Forward Error Correction for DNA Data Storage
نویسندگان
چکیده
We report on a strong capacity boost in storing digital data in synthetic DNA. In principle, synthetic DNA is an ideal media to archive digital data for very long times because the achievable data density and longevity outperforms today’s digital data storage media by far. On the other hand, neither the synthesis, nor the amplification and the sequencing of DNA strands can be performed error-free today and in the foreseeable future. In order to make synthetic DNA available as digital data storage media, specifically tailored forward error correction schemes have to be applied. For the purpose of realizing a DNA data storage, we have developed an efficient and robust forwarderror-correcting scheme adapted to the DNA channel. We based the design of the needed DNA channel model on data from a proof-of-concept conducted 2012 by a team from the Harvard Medical School [1]. Our forward error correction scheme is able to cope with all error types of today’s DNA synthesis, amplification and sequencing processes, e.g. insertion, deletion, and swap errors. In a successful experiment, we were able to store and retrieve error-free 22 MByte of digital data in synthetic DNA recently. The found residual error probability is already in the same order as it is in hard disk drives and can be easily improved further. This proves the feasibility to use synthetic DNA as longterm digital data storage media.
منابع مشابه
Coding over Sets for DNA Storage
In this paper we study error-correcting codes for the storage of data in synthetic DNA. We investigate a storage model where a data set is represented by an unordered set of M sequences, each of length L. Errors within that model are losses of whole sequences and point errors inside the sequences, such as insertions, deletions and substitutions. We propose code constructions which can correct e...
متن کاملStudy of Forward Error Correction using Reed—Solomon Codes
Digital communication system is used to transport an information bearing signal from the source to a user destination via a communication channel. Channel coding is used for providing reliable information through the transmission channel to the user. In channel coding the number of symbols in the source encoded message is increased in a controlled manner in order to facilitate two basic objecti...
متن کاملOn Burst Error Correction and Storage Security of Noisy Data
Secure storage of noisy data for authentication purposes usually involves the use of error correcting codes. We propose a new model scenario involving burst errors and present for that several constructions.
متن کاملA Low Complexity Forward Error Correction for PAPR reduction in OFDM Systems
In this paper, a Peak to Average Power Ratio (PAPR) reduction technique in Orthogonal Frequency Division Multiplexing (OFDM) systems is proposed. This technique uses error correction capability of channel coding to reduce the PAPR.The coded bits at the input of OFDM modulator are separated into two groups of the Most Significant Bits (MSB) and the LeastSignificant Bits (LSB). The bits are mappe...
متن کاملEfficient and Safe Data Backup with Arrow
We describe Arrow, an efficient, safe data backup system for computer networks. Arrow employs techniques of delta compression (or deduplication) to achieve efficient storage and bandwidth utilization, and collision-resistant hashing and error-correction coding to protect against and correct storage errors. keywords: content-addressable storage; error-correcting storage systems; data backup; ded...
متن کامل